To APPEAR in ApJ 

Preprint typeset using LAT^X style cmulatcapj v. 11/10/09 



O 

(N 

£ 

O 

u 

6 

in 

C3 



> 

O 

00 



O 



X 
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ABSTRACT 

We present the results of a systematic search for massive black hole binaries in the Sloan Digital 
Sky Survey spectroscopic database. We focus on bound binaries, under the assumption that one of 
the black holes is active. In this framework, the broad lines associated to the accreting black hole are 
expected to show systematic velocity shifts with respect to the narrow lines, which trace the rest-frame 
of the galaxy. For a sample of 54 586 quasars and 3 929 galaxies at redshifts 0.1 < z < 1.5 we brute- 
force model each spectrum as a mixture of two quasars at two different redshifts. The spectral model 
is a data-driven dimensionality reduction of the SDSS quasar spectra based on a matrix factorization. 
We identified 32 objects with peculiar spectra. Nine of them can be interpreted as black hole binaries. 
This doubles the number of known black hole binary candidates. We also report on the discovery of a 
new class of extreme double-peaked emitters with exceptionally broad and faint Balmer lines. For all 
the interesting sources, we present detailed analysis of the spectra, and discuss possible interpretations. 
Subject headings: methods: data analysis; methods: statisticals; quasars: general; quasars: emission 
lines 



1. INTRODUCTION 

Massive black hole (BH) pairs are the natural outcome 
of mergers through the hierarchical formation of galax- 
ies. Examples of unbound BH pairs, with separations of 
< 1 kpc, have been observed, as in t he prototypical case 
of NGC 6240 (|Komossa et all I20031 see Colpi & Dotti 
2009 for a recent review). At separations of ~ few pc 
the two BHs start experiencing their own gravitational 
interaction, binding in a BH binary (BHB). Observing 
BHBs is challenging, since they cannot be spatially re- 
solved in optical and X-ray. The only spatially resolved 
BHB candi date to date is hosted by the elliptical galax y 
0402+379 (jManess et all [20041 : iRodriguez etall I2006I ). 
The two flat-spectrum radio sources, corresponding to 
the two components of the candidate BHB, have a pro- 
jected separation of w 7 pc. At the distance of 0402+379 
(z = 0.055) this corresponds to few milliarcsec, an an- 
gular scale that can be probed only through radio in- 
terferomet ry. A second BHB c andidate is the BL Lac 
OJ287 (see fValtonen et al"1l2008l . and references therein). 
It shows a as 12 yr modulation in its light curve, that 
has been interpreted as related to the orbital period of a 
BHB lurking in the nucleus of the AGN. 

All the other BHB candidates discussed to date 
have been identified by studying their optical and 
near-infrared spectra. In a BHB scenario, the broad 
lines (BLs) emitted by gas bound to each BH may be 
red- or blue-shifted with respect to their host galaxy 
rcdshift, as a consequence of the Keplerian m otion of 
the binary (jBegelman. Blandford fe Reesl 119801 ) . Fur- 
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thermore, the BL region of each BH can be perturbed 
and stripped by the gravitational potential of the 
companion, resulting in peculiar flux ratios between 
BLs with different ionization potential (|Montuori et al.l 
20101 ). This spectroscopic approach does not suffer any 
angular resolution limitations: Actually, the closer (and 
more massive) the binary is, the more shifted/deformed 
the BLs are. Thanks to the existence of large spectro- 
scopic surveys, such as the Sloan Digital Sky Survey 
(SDSS), a large region of the sky can be probed. Up 
to date five spectroscopically iden tified candidates have 
been pr esented: J0927+2943 (IKomossa. Zhou fe Lul 
2008t IBogdanovic. Eracleous fe Sigurdssonl 120091: 
Dotti et all I2009D . J1536+0441 dBoroson fe Lauerl 
2009), J1050+3456 (IShields et all 120091) . 4C+22.25 
(J1000+2233 in this paper iDecarli et al.l |2010H). and 
J0932+0318 (jBarrows et al.ll201lD . Such a small number 
of objects is marginally compatible with the theoreti- 
cally predicted number of sub-parsec BHBs at z < 0.7 
(5-10, given the merger rate and under reasonable 
ass umptions on the bin ary lifetime and observability; 
see IVolonteri et"aIll2009D . 

The spectroscopic approach has an obvious drawback: 
a peculiar spectrum does not guarantee the presence of 
a BHB in the nucleus of the host. As an example, an 
unobscured BHB with both BHs active could resam- 
ble the spectrum of a dou ble peaked emitter (see, e.g., 
lEracleous fc Halpernlll994l) , where broad double-peaked 
lines are emitted because of the almost edge-on, disk- 
like structure of the BL region of a single BH. A binary 
with a single accreting BH would show a single shifted 
BL. If the shift corresponds to a relatively small veloc- 
ity along the line of sight (< 4 000 km s _1 ), the same 
signature could be emitted by a remnant of a binary co- 
alescence, recoiling because of anisotropic gravit ational 
wave emission (e.g. IKomossa. Zhou fc Lul 1200 8 P~l. Fi- 

5 If the galaxy merger is gas ri ch, the maximum recoil velocity 
is expected to be < 100 km s~ 1 (Bogdanovic. Reynolds & Miller 
120071 ; ITtotti et al.ll20"T(il ; IVolonteri et al . 2010; Kcsdcn etaTtolOl) . 



2 



Tsalmantza et al. 



nally, both the cases can be reproduced by a chance su- 
perposition of two AGN (or an AGN-galaxy superposi- 
tion) withi n the angular resoluti on of the used spectro- 
graph (e.g. iHeckman et al.ll2009T ). The simplest way to 
discriminate between these scenarios and the BHB hy- 
pothesis would be to look for a periodic oscillation of the 
BL shifts around the host galaxy redshift. However, the 
orbital pe riod of the binary could be too long to be easily 
observed (Bcgclman, Blandford & Rccs 1980). For these 
reasons other possible explanations have been proposed 
for all the BHB candidates discussed in literature. 

To overcome the paucity of BHB candidates and the 
uncertainties related to their interpretation, we depict 
two ways: 

Theoretically a better description of the spectrum of 
BHB is needed, in order to identify other character- 
istic signatures of binaries. Few attempt as been 
made up to date (see, e.g. IBogdanovic et al.l 120081: 
Bogdanovic. Eracleous fc Sigurdssonll2009l : iShen fe Loebl 
20101: iMontuori et al.l l2010). 

Observationally all the BHB candidates lurking in large 
spectroscopic catalogue must be identified, through a 
meticulous study of all the possible spectra, in order to 
allow for follow-up studies on a significant sample of ob- 
jects. 

In this paper we explore the second path, describing 
the results we obtained from a comprehensive search of 
BHBs in the SDSS DR7. The code we use, described 
in Section [21 automatically detects sources with a spec- 
trum consistent with a BHB, a double peaked emitter, a 
superposition, or a recoiling BH. We present all the pecu- 
liar objects we find in Section [31 where, for each object, 
we compare our results with previous findings available 
in the literature. Conclusions are drawn in Section [4l 
Throughout the paper we will assume a standard cos- 
mology with Ho = 70 km s _1 Mpc -1 , fi m = 0.3 and 
fl A = 0.7. 

2. THE METHOD 

2.1. Present analysis 

In this study we perform an automatic and system- 
atic search for BHB candidates in SDSS catalogue, look- 
ing for composite spectra of two sources with a velocity 
difference up to 30 000 km s — 1 . More specifically, we 
look for the presence of two sets of emission lines (one 
broad and one narrow) with a small separation between 
them, caused by the Keplerian rotation of one compo- 
nent of the binary sy stem. For this purpose we use the 
method described in iTsalmantza fc Hogd (|2011h . As a 
first step we extract a small set of components that can 
sufficiently reconstruct the SDSS QSO spectra by using 
HMF (Heteroscedastic Matrix Factorization or HMF), 
a bilinear model optimized with probabilistically justi- 
fied weighted least-squares objective function, analogous 
to principal components analysis but making use of the 
observational noise model. The method uses a subset 
of data at rest-frame wavelengths, described in detail in 
Section (|2.2j) . as a training set to define the set of basis 
functions that minimize the scalar Xe- 

In this case, a significant shift between the different sets of lines 
would not be compatible with a recoiling BH. 
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where the first term corresponds to the total \ 2 of the 
fit of the training data points with errors o~ij , by a set 
of K components gkj and coefficients , over all the N 
spectra and the M pixels of the training set. The second 
term corresponds to a smoothing prior or regularization 
that prefers small pixel-to-pixel variations in each result- 
ing component. The strength of the smoothing is set by 
the scalar e. An iterative procedure is used to minimize 
In each step, we fix g^j and estimate the optimal a^fc, 
and then hold the fixed and estimate the optimal g^j ■ 
The procedure is repeated until the solution converges. 
We initialize the fitting with the output of PCA when 
applied to an extended set of training spectra. 

We then use the resulting set of components to fit each 
observed spectrum at the redshift provided by SDSS. Af- 
ter that, we repeat the fitting using two sets of compo- 
nents at different redshifts. Here one set is assumed to 
be at the SDSS redshift, while the second is free to vary 
over a broad range of z, corresponding to velocity differ- 
ences up to 30 000 km s _1 . If a second redshift system is 
present, we expect the fit to significantly improve when 
we add the second set of components. This procedure 
has already been proven to successfully identify four of 
the known BHB candidates (J0927 +2943, J1536+0441 
J1050 +3456, and J1000+2233, see ITsalmantza fc Hogg! 
1201 lh with an estimate of the velocity shifts between the 
two sets of lines consistent with what obtained in previ- 
ous studies. 

2.2. The training sample 

To train the HMF we use the same training set of 
quasa r spectra that was used in ITsalmantza fc Hogd 
(|2011h . The sample consists of spectra in the redshift 
range 0.1-1.5. However, since we are mainly interested in 
detecting shifts between the narrow and the broad emis- 
sion lines in the QSO spectra, we check if our method is 
more sensitive in detecting interesting objects when one 
of the two sets of components used to fit each spectrum 
is representative of spectra with only narrow emission 
lines. To define components that include only narrow 
lines, we use galaxy spectra for the training of the HMF. 
The galaxy sample consists of spectra classified spectro- 
scopically as galaxies in SDSS with S/N>20 and EW of 
the [On] and [Oin] lines larger than 20 (10 856 spectra). 

All spectra used for the training and the applica- 
tion of the method are derived from the 7th Data 
Release (DR7) of SDSS. Pixels with any of the flags: 
SP.MASK.FULLREJECT, SP_MASK_BRIGHTSKY, 
SP.MASK.NODATA, SP.MASK.NOSKY or pixels that 
correspond to zero noise were treated as masked. All 
the spectra and their noise were moved to the rest-frame 
(assuming z = zsdss)j resulting in spectra with different 
spectral coverage. However, both HMF and PCA (the 
output of which was used as an initialization to the 
HMF) require common wavelengths for all the training 
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spectra. The common wavelength range was defined 
by the pixels that included information for at least 
10 spectra of each type (i.e. galaxies or QSOs). The 
final wavelength coverage was 1522.299-8352.183 A for 
the QSOs and 3044.388-9193.905 A for the galaxies, 
corresponding to 7394 and 4801 pixels respectively. As a 
last step before the application of the method all spectra 
were interpolated to common wavelengths selected for 
each type of source using cubic splines. The selected 
wavelengths are uniformly distributed in log space as 
in the case of the original SDSS wavelengths. For the 
resulting spectra we interpolated linearly the values of 
the masked pixels. 

PCA was performed separately for the QSO and the 
galaxy sample, using a number of spectra equal to the 
number of pixels selected for each source. The training 
spectra were first projected into a subspace orthogonal 
to the mean spectrum of the data set and the flux in 
each spectral bin was divided with the RMS of the noise 
in that bin, for all the non-masked pixels in the train- 
ing sample. The PCA results were used as an initializa- 
tion to the HMF. The method was run for a subset of 
approximately 1 000 spectra of each type for a different 
number of components and for 16 iterations, until con- 
vergency. This test was also performed for four different 
values (1,3,10 and 30) of the smoothing scalar e. To per- 
form a simple cross validation, the resulting components 
were also used to fit another subset of 1 000 test spectra 
for each type. Based on these tests, we decided to use 
14 and 7 components for the QSOs and galaxies and e 
values of 10 and 1 respectively. Those numbers were also 
defined by using the spectra of the known BHB candi- 
dates and testing the ability of the components to detect 
them. 

2.3. Selection criteria 

We apply our newly developed fitting scheme to all 
the 54 586 QSO spectra of SDSS in the redshift range 
0.1-1.5. Addition ally, since one of the k nown BHB candi- 
dates (4C+22.25. lDecarli et ai1l2010bft is classified spec- 
troscopically as a galaxy in the SDSS catalog, we also 
applied the method to objects with redshift from 0.2-1.5, 
that are classified by SDSS as galaxies and have fiber 
magnitudes that correspond to u — g < 0.8, g — r < 1.5, 
r— i < 1.0 and g < 21. We note that the majority of these 
sources (3 518 out of 3 929) were targeted as quasar can- 
didates by SDSS. For each spectrum the fitting was per- 
formed for all the combinations of the extracted QSO and 
galaxy components: i) two QSO components for both the 
spectral components (hereafter QSO-QSO), ii) a QSO 
component for the spectrum at the SDSS redshift and a 
galaxy components at the second redshift (QSO-Galaxy) 
and iii) the opposite of case (ii) (Galaxy-QSO). Out of 
the 175,545 fitting results we assigned priority based on 
the following criteria: 

i- The fit significantly improves by adding a set of 
components at a second redshift, i.e. it corresponds 
to large \ 2 difference. The threshold was set based 
on the values extracted for the four known candi- 
dates and by visual inspection of the fitting results 
for various \ 2 difference values. 

ii- The peaks of the \ 2 difference do not correspond 



to fits with unphysical properties (e.g. negative 
emission lines). Negative residuals are common 
when fitting the spectra with one set of compo- 
nents. Their strength, and therefore their impact 
in the results, can vary from very weak features 
usually caused by details in the continuum fitting 
and the presence of noise, to very strong features 
caused by poor fitting of emission lines. To exclude 
the latter cases (but not the former) from our final 
sample we examine only peaks of the \ 2 differences 
that correspond to positive differences between the 
99.5% and the 0.5% quartiles of the distribution of 
fluxes per pixel in each fitted spectrum (for each 
set of components). 

iii- The peaks of the x 2 difference correspond to a dif- 
ference in redshift between the two sources larger 
than 0.01. 

In the case that Galaxy-QSO fitting was performed 
to the spectra, there was a lot of contamination to our 
results due to two additional reasons: 

i- In most of the cases the fitting of the narrow emis- 
sion lines at the SDSS redshift by the galaxy com- 
ponents was resulting to unusual residuals of the 
broad lines, (e.g. in the case that no significant 
shift was present between the NLs and BLs), which 
were then fitted very poorly by the second set of 
components. To exclude these objects from our 
final sample we also measured the % 2 value of the 
fit by the second set of components restricted to an 
area of 500 A around the Ha line and 200 A around 
the H/3 line. Peaks corresponding to a very large 
value of x 2 a t the areas of the BLs were excluded. 

ii- In all cases the fitting of a QSO spectrum by the 
galaxy components will improve significantly when 
the QSO components are added to the fit. That 
is due to the fit of the broad emission lines by the 
second set of components. To make sure that the 
observed improvements were not only caused by 
this, we re-perform the fitting of the spectrum by 
two sets of QSO components, at the redshifts sug- 
gested by the Galaxy-QSO fitting. 

The fits of the spectra selected using the above crite- 
ria were visually inspected independently by two of the 
authors (PT and RD). The fitting results showed that 
there is a lot of contamination, e.g. from obvious Double 
Peaked Emitters (DPEs), misclassified objects in SDSS 
or spectra with wrong estimated redshifts. By excluding 
all those cases, the most interesting objects were iden- 
tified. In Figure [1] we provide an example of the fitting 
output for one of those sources. 

The objects selected here are presented in detail in the 
following section. We should point out that all the ob- 
jects selected for their interesting features are at redshifts 
below 0.8. For higher redshifts the [O in] line is not in- 
cluded in the spectra and the differences in the fitting are 
detected mainly based on the [O ii] line, which in most of 
the cases is very faint or not well detected. For this rea- 
son no reliable information for the narrow emission lines 
are included in most of the spectra at higher redshifts. 
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TABLE 1 

Summary of the peculiar objects found with our code. (1) Quasar name. (2-3) Right ascension and declination (J2000.). (4) Modified 
Julian Date of the SDSS observation. (5) Plate. (6) Fiber. (7) Redshift of Narrow Lines. (8) Rcdshift of broad lines. (9) Method used to 

select the target. QG: Quasar-Galaxy; QQ - Quasar— Quasar; GQ: Galaxy-Quasar (see the text for the method description). (10) 
Classification: A- Asymmetric BL profile; B- Black hole binary candidate; D- Double-peaked emitter; E- Extreme double-peaked emitter; 

O- Others (see section [3j) . 



Obj.name R.A. Dec. MJD Plate Fiber znl z bl Method Class. 



(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


J0012-1022 


00:12:24.03 


-10:22:26.3 


52141 


0651 


072 


0.228 


0.221 


QG,QQ 


A 


J0155-0857 


01:55:30.02 


-08:57:04.0 


52168 


0665 


597 


0.165 


0.170 


QQ 





J0221+0101 


02:21:13.15 


+01:01:02.9 


51869 


0406 


374 


0.354 


0.364 


QG.QQ 





J0829+2728 


08:29:30.60 


+27:28:22.7 


52932 


1267 


066 


0.321 


0.325 


GQ 





J0918+3156 


09:18:33.82 


+31:56:21.2 


52990 


1592 


139 


0.452 


0.457 


QQ,GQ 


D 


J0919+1108 


09:19:30.32 


+11:08:54.0 


53050 


1740 


399 


0.369 


0.372 


QQ 





J0921+3835 


09:21:16.13 


+38:35:37.6 


52731 


1214 


293 


0.187 


0.182 


QQ 


A 


J0927+2943 


09:27:12.65 


+29:43:44.1 


53389 


1939 


467 


0.713 


0.698 


QG.QQ 


B 


J0931+3204 


09:31:39.05 


+32:04:00.2 


53386 


1941 


553 


0.226 


0.226 


QQ.GQ 


O 


J0932+0318 


09:32:01.60 


+03:18:58.7 


52254 


0568 


039 


0.420 


0.401 


QQ 


B,D 


J0936+5331 


09:36:53.85 


+53:31:26.9 


52281 


0768 


473 


0.228 


0.237 


QQ 


A 


J0942+0900 


09:42:15.12 


+09:00:15.8 


52757 


1305 


281 


0.213 


0.168 


GQ 


E 


J0946+0139 


09:46:03.95 


+01:39:23.7 


51989 


0480 


480 


0.220 


0.227 


QQ 


A 


J1000+2233 


10:00:21.80 


+22:33:18.6 


53737 


2298 


102 


0.419 


0.377 


GQ 


E,B 


J1010+3725 


10:10:34.28 


+37:25:14.8 


52993 


1426 


110 


0.282 


0.276 


QG.QQ 


O 


J1012+2613 


10:12:26.86 


+26:13:27.3 


53757 


2347 


513 


0.378 


0.351 


QQ 


E,B 


J1027+6050 


10:27:38.54 


+60:50:16.5 


52375 


0772 


216 


0.332 


0.301 


QQ 


E 


J1050+3456 


10:50:41.36 


+34:56:31.4 


53431 


2025 


603 


0.272 


0.258 


QQ.GQ 


B 


J1105+0414 


11:05:39.64 


+04:14:48.2 


52356 


0581 


226 


0.436 


0.406 


GQ 


E 


J1117+6741 


11:17:13.91 


+67:41:22.7 


51942 


0491 


402 


0.248 


0.253 


GQ 





J1154+0134 


11:54:49.42 


+01:34:43.6 


52051 


0515 


099 


0.469 


0.450 


QQ 


A,B 


J1207+0604 


12:07:55.83 


+06:04:02.8 


52376 


0842 


530 


0.136 


0.128 


GQ 


O 


J1211+4647 


12:11:13.97 


+46:47:12.0 


53116 


1449 


001 


0.294 


0.287 


QQ,GQ 





J1215+4146 


12:15:22.78 


+41:46:21.0 


53120 


1450 


111 


0.196 


0.206 


QQ,GQ 





J1216+4159 


12:16:09.60 


+41:59:28.4 


53120 


1450 


130 


0.242 


0.233 


GQ 





J1328-0129 


13:28:34.15 


-01:29:17.6 


52426 


0911 


333 


0.151 


0.140 


QQ.GQ 





J1414+1658 


14:14:42.03 


+16:58:07.2 


54523 


2758 


014 


0.237 


0.242 


QQ 





J1440+3319 


14:40:05.31 


+33:19:44.5 


53498 


1646 


283 


0.179 


0.165 


GQ 


D 


J1536+0441 


15:36:36.22 


+04:41:27.0 


54567 


1836 


270 


0.389 


0.373 


QG.QQ 


D,B 


J1539+3333 


15:39:08.09 


+33:33:28.0 


52823 


1355 


572 


0.226 


0.199 


QG.QQ 


0,B 


J1652+3123 


16:52:55.90 


+31:23:43.8 


52790 


1343 


593 


0.593 


0.590 


QG,QQ,GQ 


O 


J1714+3327 


17:14:48.51 


+33:27:38.3 


54591 


2973 


190 


0.181 


0.186 


QQ 


Q,B 



3. RESULTS 

Our selection produced a list of 32 candidates of par- 
ticular relevance (see Table [T]). For each target, we re- 
analyzed the SDSS spectrum, modeling it with a power 
law for the QSO continuum emission, a host galaxy tem- 
plate at the redshift of the NLs and a template o f the 
iron complex, a s desc ribed in IDecarli et al.l (|2010af) and 
iDe Rosaet al. (2011). We fitted the broad components 
of Mgn, H/3 and Ha with 2 gaussian functions at the 
same peak. This fitting approach aims to better con- 
strain the peak wavelength, and is not meant to repro- 
duce the line profile in detail. Narrow lines are masked 
when fitting the broad component; by construction of our 
sample, there is a velocity offset between BLs and NLs. 
This simplifies the measurement of the peak wavelengths 
of the two components. 

Peak wavelengths are then converted into velocity 
shifts: 



Continuum-subtracted velocity plots of all the interesting 
targets are shown in Figure [2] 



A rough classification scheme was set according to: 1) 
the magnitude of the velocity shift, in particular when 
comparing Mgn and Balmer lines; 2) the presence of 
strong asymmetries in the line profiles; 3) the occurrence 
of secondary bumps or peaks; 4) additional information 
from other emission lines or from the SDSS images. We 
define five classes of objects, namely; i) black hole bi- 
nary candidates, which are expected to show similar 
veloc ity shifts for all the BLs, and a variety of line profiles 
(e.g,. IShen kTo^Mh . ii) quasars with asymmetric 
line profiles, with small (< 2 000 km s^ 1 ) shifts of BL 
peaks. These features are observed in some "normal" 
type-I AGN, and they are p ossibly related to a symme- 
tries in the BL region (e.g.. iBentz etaLl [2010h . They 
may also be associated to a velocity-dependent Balmer 
decrement of broad lines, in) double peaked emitters 
(DPEs) , characterized by symmetric features in line pro- 
files (e.g., a secondary peak in the red wing of the line, at 
the opposite velocity with respect to a blue-shifted peak). 
Another property of DPEs is that different lines (in par- 
ticular low- and high-ioniz ation lines) may sh ow very 
different shapes and shifts (lHalpern" "etaTI fl996h . These 
properties are usually associated to a disk-like structure 
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MJD=52823, libel ld=572. plateld=1355, z=0.1993. z 2nd=0.2263 
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2nd redshift 
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4000 5000 6000 7000 
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8000 9000 



Fig. 1. — An example of an interesting object (J1539+3333) de- 
tected by the method. Top panel: The x 2 difference between the 
fitting of the spectrum with 1 and 2 sets of components at differ- 
ent redshifts. The green and blue lines represent the SDSS redshift 
(^sdss) an d the one with the largest x 2 difference («fit)- 2nd panel: 
The fitting of the spectrum (black) with both sets of components 
(red). 3rd panel: Black line: residuals after subtracting the fit 
components z s ^ ss . The fit with the set of components at Zg t is 
shown in red. Bottom panel: Black line: residuals after sub- 
tracting the fit at 2fi t . Red line: the fit with the set of components 
at ^sdss- A color version of the plot is available in the electronic 
version of the paper. 



of the BL region (jEracleous fe Halpernl fl99l . iv) ex- 
treme double peaked emitters (see below), v) oth- 
ers, i.e. objects with small shifts or poor signal-to-noise 
spectra, preventing us from a clear interpretation, or ob- 
jects with very peculiar properties, not belonging to any 
of the aforementioned classes. 

Note that all the BHB candidates with small veloc- 
ity shifts could also be recoiling BHs , though they are 
expected to be rare r than binaries (jDotti et al.l 120091 : 
iVolonteri et"aII [2010l B l. In the following, we will not dis- 
tinguish between these two cases, including both in the 
"BHB candidates" class. 

This classification produced 9 BHB candidates, includ- 
ing the 5 known candidates. For the new 4 sources, other 
interpretations are also plausible, including an extremely 
rare case of quasar-galaxy superposition for one of them. 
Five quasars show very high velocity shifts (> 5 000 km 
s -1 ) and relatively faint lines. These objects probably 
represent extreme cases of DPEs (hereafter, they will be 
referred to as EDPEs). In the following, we discuss the 
properties of each source individually, reporting our in- 
terpretation on the nature of the object. 

J 00 12- 1022 

The Balmer broad lines of this znl = 0.228 quasar 
show a peak ~ 1700 km s _1 blue-shifted with respect 
to narrow lines. The line profile is clearly asymmet- 
ric. Ha and H/3 have identical profiles, with F(Ha) = 
2.6F(H/3). A bump in the red wing of Balmer lines sug- 
gests that this is a strongly asy mmet ric double-peake d 
emitter (see also iStrateva et al.)l2003t iShen et al.ll2010D . 

6 Note that these estimates could be affected by our incomplete 
understa nding of the orbital decay of binaries at subparsec scales 
(see, e.g. lColpi ^TDo tti 2009: Lodat o et al.ll2009h . 



though IShen fe Loebl ((2010T ) showed that the line profile 
of this source can be ascribed to an unequal mass BHB. 
Classified: asymmetric line profile. 

JO 155-0857 

The Ha line of this source shows a small (~ 1 500 km 
s _1 ) red-shift with respect to the narrow lines. A slight 
asymmetry in the line profile is reported. The asymme- 
try is clearer in the H/3 profile, which peaks at lon ger 
wavelengths (Av 2 200 km s" 1 ). IShen et al.1 (|2010D re- 
ported a blue-shift of ~ 600 km s" 1 for Ha and ~ 2 300 
km s _1 for H/3. The relatively small velocity shift and 
the difference in the profiles of Balmer lines suggest that 
this is a normal quasar. Classified: others. 

J 0221 +0101 

Both Ha and H/3 broad components of this znl = 0.354 
quasar show a ~ 1 300 km s" 1 shift with respect to the 
narrow lines. Li ne profiles a r e rath er boxy with no obvi- 
ous asymmetry. IShen et all (|2010l) report no significant 
shift for Ha, and an exceedingly pronounced shift for H/3 
(~ 3 100 km s _1 ). Classified: others. 

J0827+2728 

The Balmer lines of this quasar show a small (~ 900 
km s _1 ) red-shift with respect to the NLs, which may 
also be consistent with a strongly asymmetric line profile. 
Classified: others. 

J0918+3156 

The pec uliar properties of this object were fi rst re- 
ported by IBonning. Shields fe Salvianderl (|2007t) . The 
Mgn and H/3 broad lines are red-shifted with re- 
spect to the NLs. However, the shift is ~ 3 000 km 
s -1 for H/3 and only ~ 1050 km s" 1 for Mgli (se e 
IBonning. Shields fc Salvianderl [20071 IShen et all 120101 ). 
suggesting that the shift is due neither to a BHB nor 
a recoil. Classified: double peaked emitter. 

J 0919+ 1108 

The broad emission lines of this source are slightly 
red-shifted (700 — 1000 km s _1 , depending on the sub- 
traction of the [Nil] lines). The SDSS image of the 
quasar reveals a complex morphology, probably resulting 
by a strong gra vitational in t eracti on or a merger with a 
nearby galaxy. IShen et all (|2010l) reported a blue-shift 
for the broad component of Ha and a red-shift for H/3, 
which is not confirmed in our analysis. Classified: oth- 
ers. 

J0921+3835 

This object shows clearly asymmetric Balmer lines, 
with a peak at ~ 1 200 km s " 1 blue- wards (c onsis- 
tent with the values reported bv IShen et alJ [2~010). The 
Ha/H/3 flux ratio is ~ 3.3. An unidentified line is ob- 
served at 5812 A, corresponding to a ~ 2 300 km s _1 
shift with respect to the H/3 rest frame. A similar peak 
is not observed in the Ha profile. Classified: asym- 
metric line profile. 



6 



Tsalmantza et al. 



J6918- 3.15'6 

I Alii: 

m . 




-10 10 

Velocity [xlOOO km s" 1 ] 



Fig. 2. — Velocity diagrams of Ha (red, solid lines), H/3 (blue, dotted lines) and Mgn (green, dashed lines) for all our candidates. The 
flux of H/3 is scaled up to match the one of Ho or Mgn. 
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J0927+2943 

One of the known BHB candidates 
(iBogdanovic. Eracleous fc Sigurdssonl 120091 iDotti et all 
2009). This qu asar was first reported by 



Komossa. Zhou fe Lul (|2008f) as a recoiling BH can- 



didate. Three sets of lines are observed, two narrow 
systems at z = 0.713 and z = 0.699 and a broad 
line system consistent with the second set of narrow 
lines. The shift between the two redshifts is ~ 2 600 
km s _1 . This object has been interpre ted as a chance 
superp osition in a galaxy cluster by IHeckman et al.l 
(2009) and IShields. Bonning fc Salvianden J2009). 



However, later observational follow-ups have disproved 
the presence of a galaxy cluste r in the source field 
pecarli. Reynolds fc Dottil 120091) . Classified: black 
hole binary candidate. 

J0931+3204 

The spectrum of this quasar shows relatively broad and 
asymmetric narrow lines, which makes the determination 
of znl uncertain. The blue wing of Ha shows a bump 
at ~ 3 500 km s _1 with respect to the peak of the line. 
This feature is not observed in H/3. This object was 
not reported in p revious studies of quasars with pecu liar 
NL profiles (e.g.! Wang eFaXl [20091: ILiu et al.ll20l0allbl ). 
iShen et all (f201O) flagged this source as a DPE candi- 
date. Classified: others. 

J 0932+ 03 18 

This source shows a shift of ~ 4 000 km s -1 in the 
peak of the Mgn and the H/3 lines. IShen et al.l (|2010f ) 
reported two different velocity offsets for H/3 (~ 3 900 km 
s _1 ) and Mgn (~ 1100 km s^ 1 ) respectively, but such 
a difference is not confirmed by our analysis. The H/3 
line appears slightly asymmetric, though the line profile 
is poorly determined. This o bject has been studied in 
detail bv lBarrows et al.l ()201lD . who suggested it to be a 
DPE from an asymmetric BL region, or a BHB. Classi- 
fied: double peaked emitter or black hole binary 
candidate. 

J0936+5331 

The Balmer lines of this znl = 0.228 quasar show a 
strongly asymmetric profile, with a bright b lue peak at 
2 100 km s _1 (2250 km s _1 in the analysis by IShen et al.l 
120101) with respect to the narrow lines. The red wing 
of Ha is 3.1 x brighter than H/3; the blue wing is 4. Ox 
brighter. No peak or bump is observed in the blue wing 
at 2100 km s" 1 . This source was listed in the "DPE 
Auxiliary sample" bv lStrateva et all ([2003D . Classified: 
asymmetric line profile. 

309^2+0900 

This object has been labeled as a galaxy by the SDSS 
automatic pipeline, presumably because the broad lines 
are too flat and extended. The Balmer lines are ~ 38 000 
km s _1 broad (the broadest ever reported!) and asym- 
metric (the blue peak is brighter and peaks around 
i~ —11500 km s _1 , while the red side is fainter, flat- 
ter and extends red- wards of +20 000 km s _1 ). The 



flux ratio between Ha and H/3 is 2.6, and the lines 
have similar profile. This object is the most extreme 
DPE ever discovered in terms of line width. This object 
may also represent the prototypal to explain the peculiar 
features of other (less extreme and fainter) cases (e.g., 
J1000+2233, J1012+2613, etc). Classified: extreme 
double peaked emitter. 

J0946+0139 

The Balmer lines of this znl = 0.2203 quasar peak 
at 1550 km s" 1 b lue-wards of the NL sys tem (see also 
IShen et al.ll2010D . IBoroson fe Lauerl (|2010l ) reported an 
"anomalous H/3 profile" for this source. The broad lines 
show no obvious second peak, but the blue wings are 
slightly more extended than the red ones. Also, the flux 
ratio between Ha and H/3 is larger in the blue wing. 
Classified: asymmetric line profile. 

J 1000+2233 

This source was serendi pitously discovered by our 
group (jDecarli et al.ll2010b[ ) out of the SDSS database, 
and share some of the properties of J0942+0900. The 
Balmer lines appear faint and extremely blue-shifted 
(~ 8 700 km s _1 ). Also the Mgn line shows a blue-shift, 
but its magnitude is poorly constrained since the line 
is only partially covered by the SDSS spectrum. This 
quasar was labele d as a galaxy by the SDSS pipeline. 
IShen et all ([2010D reported inconsistent velocity offsets 
for H/3 (4 500 km s" 1 red-wards) and Mgn (2 700 km 
s^ 1 blue- wards). Classified: extreme double peaked 
emitter or black hole binary candidate. 

J 1010+3725 

This object shows a complex [O in] profile (both for the 
4959 and the 5 008 A emission lines) . Two peaks are ob- 
served, with a velocity difference of ~ 1 400 km s _1 . The 
blue peaks are fainter. Other narrow emission lines (H/3, 
Ha, [S n] ) appear normal. T his source was labeled a s 
"anomalous [O in] profile" by IBoroson fc Lauerl (120101). 
A pec uliar [O in] profile was also reported by IShen et all 
(2010). This quasar was not included in previous stud- 
ies on double-pe aked NL objects (e.g. lWang et aljr2009t 
ILiu et al.|[20l0airbh . Classified: others. 

J1012+2613 

The Balmer lines of this znl = 0.3783 quasar show 
similar properties to those of J1000+2233: The BLs peak 
~ 6 000 km s^ 1 blue-wards of the NLs. The red wing of 
Ha and the blue wing of Mgll are not covered. Classi- 
fied: extreme double peaked emitter or black hole 
binary candidate. 

J 1027+6050 



This is the only EDPE already known (Stratcva et al. 
120031) . The blue peaks of Ha and H/3 are 7 000 km s" 1 
blue-shifted wit h respect to NLs. The shift is missed in 
the analysis by IShen et al.l (|2010l ). probably because of 
the faintness of the lines. Classified: extreme double 
peaked emitter. 
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J 1050+3456 

This source was discovered bv lShields et al.l (|2009f) out 
of the SDSS database. The broad component of Balmer 
lines is clearly shifted (~ 3 400 km s" 1 bl ue-wards; a 
similar value was found by iShen et all 120101 ) . No NL is 
observed at the redshift of the BLs. Classified: black 
hole binary candidate. 

Jl 105+04U 

This quasar was found in the galaxy sample. The 
broad H/3 line peaks ~ 6 000 km s _1 blue-wards of the 
narrow component. Due to its faintness, the line profile 
is poorly constrained. The Mgn line is equally faint, but 
no obvious shift is observed, supporting the DPE inter- 
pretation for this source. Classified: extreme double 
peaked emitter. 

31111+6141 

The Ha broad component of this source appears 
slightly redshifted with respect to the NLs. The proper- 
ties of H/3 are difficult to characterize, due to its intrinsic 
faintness. Classified: others. 

J1154+0134 

The H/3 and Mgn lines of this znl = 0.469 quasar 
have identical profiles, with a peak ~ 3 500 km s^ 1 blue- 
wards of the expected wavelengths and a rather broad 
red wing. The line profiles resemble the one of other 
sources with asymmetric lines (e.g., J0012-1022), but the 
magnitude of the shift and the similarity between H/3 and 
Mgil support the BHB hypothesis. The low S/N of the 
spectrum of the SDSS spectrum hinder any conclusion 
on the nature of this source. Classified: asymmetric 
line profile or black hole binary candidate. 

J1201+0604 

The broad component of the Balmer lines of this quasar 
is rather symmetric but shifted ~ 2 500 km s _1 blue- 
wards with respect to NLs. The flux ratio between Ha 
and H/3 is ~ 4, roughly constant with respect to the line- 
of-sight velocit y. This source w as not included in the 
compilation bv lShen et al.l (|2010T ). Classified: others. 

J 121 1+4641 

The bulk of the Balmer line broad components of this 
source is shifted ~ 1 700 km s _1 blue- w ards of the NL sys - 
tem (consistent values are reported in lShen et al.ll2010l ). 
The Ha/H/3 flux ratio is ~ 4, constant along the velocity 
profile. The Ha line shows a bump at ~ 6 000 km s _1 
in the red wing, possibly revealing the DPE-like nature 
of this source. Such a feature is not clearly observed for 
H/3 because of the superposition of the [Oin] doublet. 
Classified: others. 

31215+4146 

This quasar shows a peculiar Balmer line profile. The 
bulk of Ha emission arises from a bright bump in the 
red wing. The blue side of the line may also present a 



faint wing, the actual presence of which depends on the 
continuum modeling. At zero order, the H/3 line shows 
analogous profile. However, the feature in the red wing 
is ~ If times fainter than what obse rved in Ha. The 
interp retation of this object is unclear. iBoroson fc Lauerl 
(2010) labeled this source as a 'no broad line' quasar. 
Classified: others. 

J 1216+ 4159 

The Ha broad emission of this quasar is clearly blue- 
shifted (~ 2 300 km s _1 ). The line profile shows no rel- 
evant asymmetry. The H/3 broad component is barely 
detected, its flux being ~ 7 times fainter than Ha. Clas- 
sified: others. 

J1328-0129 

The broad component of the Balmer lines in this znl = 
0.1514 quasar are blue-shifted (~ 3 100 km s _1 ) with re- 
spect to NLs. The line profile is boxy, with no significant 
asymmetry. The Ha/H/3 flux ratio is ~ 5, constant over 
the velocit y range. This obj ect was not included in the 
analysis by l Shen et al.l (|2010j l. iStrateva et all (|2003t ) and 
Bian et alJ (|2007t l listed this source as a DPE. Classi- 
fied: others. 

J14U+1658 

The bulk of the BLs of this quasars is redshifted (~ 
1200 km s^ 1 ) with respect to NLs. The red wing is 
brighter. The Ha/H/3 flux ratio is ~ 4 in the blue wing 
and around 3 in the re d wing. Thi s obje ct was labeled 
as a DPE candidate bv lShen et ail (|2010[ ). Classified: 
others. 

J1440+3319 

The Ha line of this source peaks at ~ 3 700 km s _1 
blue- wards of the NLs, and shows an extended red wing. 
The H/3 line profile is similar. The properties of this 
quasar are half the way between the objects with asym- 
metric line profiles (e.g., J1154+0134) and the typical 
DPEs, though this source has not been included in any 
previous compilation of DPEs. Classified: double 
peaked emitter. 

J1536+0441 

Th e peculiar propert i es of t his object were first reported 
bv IBoroson fc Lauerl J2009). The broad lines show two 
peaks, one consistent with the rest-frame of the galaxy as 
set by NLs, the other significantly blue-shifted (~ 3 400 
km s" 1 ). IBoroson fc Lauerl (|2009f l proposed the BHB 
interpretation for this source. However, following obser- 
vations covering the red win g of Ha revealed the p resence 
of a bump in the line wing (Chornock ct al. 2010D. a fea- 
ture commonly observed in DPEs. Classified: double 
peaked emitter or black hole binary candidate. 

J 1539+ 3333 

From the spectroscopic point of view, the properties of 
this source are similar to those of J0927+2943. The spec- 
trum presents three sets of lines at two different redshifts: 
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Broad balmer lines (driving the redshift estimate by the 
SDSS pipeline) and faint narrow lines are detected at 
z\ = 0.1993. Another set of (brighter) narrow lines is ob- 
served at Z2 = 0.2263. The corresponding velocity shift is 
~ 6 600 km s _1 . A careful inspection of the SDSS image 
of this source reveals an extended stellar wing South- 
wards of the quasar. If this belongs to the quasar host 
galaxy, then it would reveal a strongly perturbed mor- 
phology. On the other hand, it could be that this is 
a superposed galaxy. In this case, since zbl < ^2, the 
galaxy would be in the background of the quasar. This 
scenario is usually extremely unlikely, given the high ve- 
locity differences (jDotti fc Ruszkowskill2010D . However, 
the SDSS image reveals the presence of a rich galaxy 
cluster South- West of the quasar, which may enhance 
the galaxy density on the sky plain by few orders of 
magnitudes. Follow-up observations aimed at directly 
measuring the redshift of the stellar wing are needed to 
fully understand the nature of this source. Classified: 
black hole binary candidate or others. 

J1652+3123 

The R(3 and Mgn broad lines of this z NL = 0.5929 
quasar show a small blue-shift (~ 500 km s _1 ) with re- 
spect to NLs. The line profiles are similar and do not 
show any significant asymmetry. Our shift es t imate s 
are consistent with those reported bv lShen et al.l (|2010j ). 
Given the small velocity difference, this source is prob- 
ably a normal quasar (jBonning. Shields fc Salvianderl 
120071) . Classified: others. 

J 1714+3327 

The Balmer lines of this source show a clear red-shift 
(~ 1 300 km s~\ consistent with the values reported in 
iShen et aLll2010D . The line profiles are symmetric. The 
Ha/H/3 flux ratio is around 3. Classified: others or 
black hole binary candidate. 

4. SUMMARY & DISCUSSION 

We presented the outcome of our automatic and sys- 
tematic search for massive BHBs. We have found 9 
BHB candidates in the SDSS DR7. Of these, 5 have al- 
ready been extensively discussed in literature. The 4 new 
candidates are J1012+2613, J1154+0134, J1539+3333 
and J1714+3327. For each one of them a BHB is not 
the only possible explanation: The peculiar spectrum of 
J1012+2613 can be explained also as an extreme case of 
double peaked emitter; Jl 154+0134 has a too noisy spec- 
trum to exclude other explanations; J1539+3333 may 
be a rare superposition of a quasar and a galaxy; The 
small shift between broad and narrow lines in J1714+332 
(pa 1 300 km s _1 ) does not necessarily imply the presence 
of a BHB. A more detailed understanding of the expected 
spectral features of BHBs and observational follow-ups 
are needed to confirm or dismiss the BHB hypothesis for 
all the 9 candidates presented here. 

Our method also automatically detected a number of 
other interesting objects with peculiar spectral features: 

i- 4 objects show strong asymmetries in the line pro- 
files, with a peak offset > 2 000 km s _1 (either red- or 
blue-wards) and a longer wing in the opposite velocity 
range with no secondary peak. 



ii- 3 objects have BL properties analogous to what typ- 
ically observed in DPEs, even if the secondary peak is not 
pr ominent. None of the m appeared in the compilation 
bv lStrateva et all (|2003| ). 

Hi- We provide strong evidence of a new class 
of extreme double-peaked emitters, with very broad 
(FWHM>10 000 km s _1 ) and rather faint emission lines. 
The main peak of these lines show huge velocity shifts 
(> 5 000 km s _1 ) with respect to t he NLs. For a c o mpar - 
ison, only 5 objects out of 138 in IStrateva et al.l (|2003l ) 
have a shift of the brighter peak of Ha larger than 5 000 
km s" 1 , and none of them exceed 7000 km s _1 . Note 
that the "extreme double-peaked emitter" explanation is 
possible also for one of the BHB candidates already di s- 
cussed in literature (J1000+2233. lDecarli etaLl 1201 Obi) . 

iv- out of the remaining 13 quasars, 12 show clear yet 
relatively small (500-3000 km s _1 ) velocity offsets be- 
tween NLs and BLs, and no obvious asymmetries in the 
line profiles of the broad components. 

v- the case of J1539+3333 is worth of specific dis- 
cussion. This quasar has few spectral features in com- 
mon with the BHB candidate J0927+2943, with a set 
of broad and narrow emission lines shifted with respect 
to a brighter set of narrow emission lines. However, 
the velocity shift is extremely high (pa 6 600 km s _1 ). 
As a consequence, this object cannot be explained in 
terms of a recoiling BH, since the maximum kick veloc- 
ity has been constr ained with fully GR simulations to 
be < 4 000 km s~ 1 (|Baker et all 120071: iHerrmann et al. 
2007 1: iCampanelli et al.l 120071: iSchnittman fc B uonanno 
20071 lLousto fc Zlochoweril200a Ivan Meter et alj |2010). 
A natural explanation can be a superposition of an AGN 
and a galaxy within a galaxy cluster. Such model has 
been ruled out for J0927+294 3 because no galaxy cluster 
was observable in the field (jDecarli. Reynolds fc Dottl 

2009) . For J 1539+3333 a superposition is not ruled 
out by observations. The redshift of this source 
is z pa 0.2, close to the the oretically estimated 
peak of superpositions in clusters (jDotti fc Ruszkowskil 

2010) . However, the line shift corresponds to a rel- 
ative velocity between the two galaxies > 1.5 times 
larger than the maximum relative veloci ty theoreti- 
cally expected (jDotti fc Ru szkowski 2010). A bet- 
ter comprehension of this peculiar object needs a 
more detailed st udy of its field, as already perfo rmed 
for J0927+2 943 (iDecarli. Reynolds k. Dottil I2009D an d 
J1536+0441 (jDecarli et al.ll2009t iWrobel fc Laorll2009j ). 
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